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Abstract 

A new communication scheme for Gaussian parallel relay networks based on superposition coding and partial 
decoding at the relays is presented. Some specific examples are proposed in which two codebook layers are 
superimposed. The first level codebook is constructed with symbols from a binary or ternary alphabet while the 
second level codebook is composed of codewords chosen with Gaussian symbols. The new communication scheme 
is a generalization of decode-and-forward, amplify-and-forward, and bursty-amplify-and-forward. The asymptotic 
low SNR regime is studied using achievable rates and minimum energy-per-bit as performance metrics. It is shown 
that the new scheme outperforms all previously known schemes for some channels and parameter ranges. 

I. Introduction 

Cooperation in wireless networks is often modeled through relay networks. In these models one or more source 
nodes communicate with one or more destination nodes with the help of intermediate relay nodes. Relay network 
models are of relevance in wireless sensor networks where the sensor nodes have limited transmission power 
capabilities and communication range. In this work we study a special type of relay network called the diamond 
network for the network with two relays and parallel relay network for the general case with more than two relays, 
introduced by Schein and Gallager [H, (U. This is a single-source single-destination layered network model with 
three layers, namely the source layer, relay layer, and destination layer, with the property that the nodes in each 
layer can only communicate with nodes in the next layer. The communication from the source to the relays takes 
place over a broadcast channel (BC) while the communication from the relays to the destination takes place over 
a multiple access channel (MAC). Note that in this simple model there is no direct communication path from the 
source to the destination. We focus on the Gaussian case, in which the transmitted signals are real and subject to an 
average power constraint, and the received signals are affected by channel attenuation and additive white Gaussian 
noise. 

Despite the simplicity of the model, its capacity, that is, the maximum reliable communication rate from source 
to destination, is in general unknown. The best known capacity upper bound is based on the cut-set bound. 
Many communication strategies have been proposed, such as decode-and-forward (DF), compress-and-forward, 
amplify-and-forward (AF), bursty-amplify-and-forward (BAF), rematch-and-forward, and combinations thereof, 
leading to various capacity lower bounds. Depending on the channel gains and signal to noise ratios (SNR), 
some communication schemes perform better than others. Many wireless sensor networks operate in the low SNR 
regime where power is a scarce resource and the main performance limiter. As a result, it is of interest to design and 
evaluate communication schemes that exhibit good performance at low SNR. We study the performance of various 
communication schemes in the asymptotic regime of SNR going to zero using as performance metrics channel 
capacity and minimum energy-per-bit (i.e. minimum required energy to communicate 1 bit of information). 

The first two communication schemes considered for parallel relay networks are DF and AF. In DF all the relays 
decode the source message and retransmit it to the destination achieving a beamforming gain. DF exhibits good 
performance when the links from the source to the relays are sufficiently stronger than the links from the relays to 
the destination. However, when the links from the source to the relays are relatively weak compared to the links 
from the relays to the destination, the requirement of decoding the source message at all relays is too restrictive. 
In AF the relays amplify the received signals with certain gains making no attempt to decode the source message. 
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While amplifying the received signals the relays also amplify the received noise. While in some regimes AF can 
achieve the channel capacity, at low SNR the relays use most of their available power amplifying the received 
noise, and the resulting performance of AF is poor. 

An approach to improve the performance of AF in the low SNR regime is to remain silent a fraction of the 
symbol times and communicate in bursts to increase the effective SNR of the signal transmitted during the bursts. 
This communication scheme, denoted BAF and proposed in O, achieves better rates than DF when the links from 
the relays to the destination are sufficiently stronger than the links from the source to the relays. In BAF, the source, 
relays, and destination know in advance the burst pattern, which is usually a contiguous block of symbols. As a 
result, the burst pattern does not carry any information. 

In this work we propose a new communication scheme tailored to the low SNR regime which we denote 
superposition-partially-decode-and-forward (SPDF). In the binary form of our proposed SPDF scheme, we generate 
a binary codebook (i.e. with symbols chosen from {0, 1}) of sparse binary patterns, and encode part of the source 
message through the selection of a codeword from this codebook. The remaining part of the source message is 
encoded through an independent codeword, transmitted in the symbol positions where the binary codeword has 
ones. The relays then decode the binary codeword and amplify the received signal only in the time indices where 
the binary codeword has ones. Finally the destination jointly decodes both codewords. We see that SPDF leverages 
the benefits of BAF of increasing the symbol SNR to avoid excessive noise amplification, while achieving some 
additional rate by conveying information through the binary codeword. As will be shown in Sections III and [V] the 
binary SPDF scheme can be generalized in various ways. 

We characterize the rates that can be achieved with the SPDF scheme, and analyze its performance in the 
asymptotic regime of SNR— > for parallel relay networks with some symmetry properties. We also study the energy- 
per-bit achievable with SPDF and obtain a minimum energy-per-bit characterization within a constant multiplicative 
factor for symmetric Gaussian diamond networks. Our results show that the new communication scheme matches 
or exceeds the performance of all previously known communication schemes in the low SNR regime. 



The remainder of the paper is organized as follows. Subsections I-A and I-B below, present related literature 



and the notation used in the rest the paper. In Section |TT] we describe the communication model that we will use 



throughout the paper. Section III presents the achievable rates with known communication schemes and with the 
new proposed SPDF communication scheme. In Section[lV]cut-set upper bounds for various networks are presented, 
which will be used in later sections to evaluate the performance of the different achievable rates. In Section [V] 
we analyze the performance of the existing and new communication schemes in the asymptotic low SNR regime 
through a capacity formulation. Section IVl] shows how the asymptotic low SNR capacity characterizations can 



be used to obtain minimum energy-per-bit performance bounds. Finally in Section VII we give some concluding 
remarks. The proofs of the results are given in the appendices. 



A. Related work 

To put our work into context, we briefly discuss related results which provide various characterizations of the 
fundamental communication limits in relay networks. While in general the upper and lower capacity bounds for 
relay networks do not match, the additive gap between them has been shown to be upper bounded by a constant 
that is independent of the channel gains and power constraints and that only depends on the number of nodes in the 
network O, HQ For the symmetric iV-relay Gaussian parallel network, the constant additive gap characterization 
has been improved in [8] to 1.8 bits, independent of the number of relays N. Capacity characterizations within 
a constant additive gap are of practical relevance in moderate to high signal to noise ratio (SNR) regimes and in 
networks of small size, in which the constant additive gap is small compared to the achievable communication 
rates. 

Many wireless sensor networks operate in the low SNR regime, for which a capacity approximation within a 
constant multiplicative factor, instead of within a constant additive gap, is more appropriate. For general relay 

'This constant additive gap capacity characterization applies to general single source relay networks. 
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Fig. 1. Communication model for the Gaussian iV-relay parallel network. 

networks with single-source multicast Avestimehr et al. [3] derived a capacity approximation within a factor of 
2d(d + 1) where d is the maximum node degree in the network. In the specific case of a parallel relay network the 
maximum degree is d = N so the above result becomes 2N{N + \). Recently, this capacity characterization within 
a multiplicative factor has been improved in (HJ for the special case of the symmetric iV-relay Gaussian parallel 
network. In this special case, the capacity has been characterized to within a multiplicative factor of 14, which is 
independent of the number of relay nodes or the maximum degree in the network. We note that this multiplicative 
factor can be readily improved for networks of small size where the exact cut-set bound can be computed. 

The low SNR regime can also be studied via a minimum energy-per-bit formulation. The minimum energy-per-bit 
in the relay channel (i.e. single relay network with a direct signal path from source to destination) has been studied 
in ifTTll . This work establishes a relationship between the minimum energy-per-bit and capacity, and characterizes 
the minimum energy-per-bit within a constant factor. 

The BAF communication scheme is not known to achieve capacity in relay channels with constant gains. However, 
it has been shown in Ifl2l that in the slow-fading Gaussian relay channel at low SNR, BAF achieves the outage 
capacity for low outage probability. 

A communication scheme denoted rematch-and-forward (RM) has been proposed in [5] for cases where there is a 
bandwidth mismatch between the source-relay and relay-destination links. In addition, RM has been shown to offer 
performance benefits even in cases where there is no bandwidth mismatch as long as the SNR is sufficiently high. 
Furthermore, RM has been studied in the context of half-duplex diamond relay networks that allow communication 
between relays 0. 

B. Notation 

Regarding notation, we use lowercase letters to denote scalars, uppercase letters to denote random variables, 
boldface letters to denote vectors, and calligraphic uppercase letters to denote sets. For example, a is a constant 
scalar, V is a random vector, and A4 is a set. A c and \A\ denote the complement and the cardinality of the set A. 
An n-vector (x\, . . . , x n ) is written as x n , and its t th element is expressed as x[t]. Differential entropy and mutual 
information are denoted by h and I and are expressed in bits. In is used for natural logarithm while log 2 is used 
for base-2 logarithm. Probability and expectation are denoted by Pr and IE. A Gaussian distribution with mean fj, 
and variance a 2 is denoted by J\f(/j,,a 2 ). We use the notation a n = 2 n ^ b±tS> to express that |i loga n — b\ < e. 1{-} 
is the indicator function. All the rates are presented in bits. 

II. Communication model 

We consider a parallel relay network consisting of a source node S, N relay nodes Relay i, i = 1, . . . , N, and 
a destination node D (cf. Figure [TJ. We focus on the memoryless discrete-time Gaussian version of the model 
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where the source and relays transmit real signals subject to given power constraints, and the received signals are 
attenuated by real channel gains and corrupted by independent additive white Gaussian noise. 

We denote by Xs[t] and Xi[t] the signals transmitted by the source and relays i = 1, . . . ,N at discrete time t 
respectively. We denote by ^fgi G M+ the channel gain from the source to relay i and by y/hj, G R+ the channel 
gain from relay % to the destination, noting that there is no loss of generality in considering non-negative channel 
gains. In addition, without loss of generality (WLOG), we assume g\ ^ gi ^ . . . ^ g^. The received signal at 
relay i, Yi, and the received signal at the destination, Yd, are given by 

Yi[t] = y/g-iXslt] + Zi[t] fori = l,...,N, (1) 

N 

Y D [t] = Y J Vh l X l [t] + Z D [t\, (2) 

i=l 

where Z\[t], Z^\t\ and Zo[t] are i.i.d. random variables with distribution N(0,Nq). 

The source wants to communicate to the destination a random message W that is uniformly distributed in the set 
{1,2,..., M}. A (2 nR , n) code for the parallel relay network consists of a set of integers M. = {1, 2, . . . , 2 nR } with 
[2 nR \ = M, called the message set and the following encoding and decoding functions. The source uses an encoding 
function Enc5 : {1, 2, . . . , M} — > M n to map any message w G M. into a vector (x5 )W [l], . . . , xs,w[n]) = Encs(u;). 
At time t relay i maps the input signals • • • , Ui[t — 1]} into an output signal Xi[t] using an encoding function 

Enc* -1 : IR* -1 — > R for i = 1, . . . , N and t = 1, . . . , n. The decoder outputs an estimate of the transmitted message 
w = Dec(y/)[i], . . . , yr> [n]) using the decoding function Dec : W 1 —^{1,2,..., M}. 

We assume the average power of the transmitted signals to be upper bounded by P$ at source node and Pi at 
relay nodes i = 1, 2, . . . , N, i.e. for a block of n symbols 

1 n 

-y>s,™M 2 <Ps, forallu;e.M, (3) 

(=1 
1 - 

- > xAt} 2 < Pi for all y? el", for i = 1, . . . , N, (4) 

n — ^ 



n 
t=i 



n 
t=i 

noting that in Q Xi [t] = Enc^^y^ -1 ). 

The average probability of error is defined as the probability that the decoder's estimate W is not equal to the 
transmitted message W, and is given by 

P e n) = ^rY. Pr {Dec({Y D [i]}?=i) | w sent}, 

where W is assumed to be uniformly distributed over the elements of M.. 

The rate R is said to be achievable for the parallel relay network if there exists a sequence of (2 nR , n) codes 
with the transmitted signals satisfying the power constraints ^ and such that Pe^ tends to zero as n tends to 
infinity. 

For a code with block length n and rate R n ^ 1/n, where the rate can vary with n, the energy of codeword 
w G M is given by 

n 

and the maximum transmission energy of relay i is 

4 n) = «up (£>[t] a Y 
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The energy-per-bit for the code is given by 

N 



\ i=i / 

An energy-per-bit £ is said to be achievable if there exist a sequence of (2 njR »,n) codes with P e (n) and 
lim sup £( n ) sC £. The minimum energy-per-bit is the infimum of the set of achievable energy-per-bit values. 

In Sections III through [V] we assume that the power constraints on the source and relays are equal to one, i.e. 
Ps = Pi = 1, for i = 1, . . . , N and the noise variances at the relays and destination are equal to one, i.e. Nq = 1. 
This normalization does not reduce the generality of the results since different power constraints and noise variances 
can be absorbed in the channel gains. 



III. Achievable rates 



A. Existing communication schemes 

In this subsection we obtain the rates achievable with DF, AF, and BAF. In DF each relay must decode the message 
transmitted by the source node. As a result, the achievable rate cannot exceed the capacity of the point-to-point 
channel from the source to each relay i. Therefore, 



#df s$ -log 2 (l + 5i ), i = l,...,N. 



(5) 



In addition, since all the relays decode the transmitted message, they can beamform their transmissions to the 
destination. The achievable rate cannot exceed the capacity of the multiple-input single-output (MISO) point-to- 
point channel from the relays to the destination, obtaining 

\ 2 " 

i + I > \fhi 



1 



-Rdf ^ - log 2 



Combining (|5]> and Q we obtain the achievable rate with the DF communication scheme 

1 



(6) 



R 



DF 



log 2 



1 + min < mm a , 

l<i<N 




(7) 



In AF, each relay amplifies its received signal by an amplification factor y 7 /^. In order not to exceed the transmit 
power constraint, the amplification factor must satisfy K{ < 1/(1 +gi). The optimal values of {Ki}f =l are obtained 
through optimization. AF also achieves a beamforming gain from the relays to the destination, but in contrast to DF, 
the received noise at the relays is amplified and added to the overall received noise at the destination. Taking into 
account the amplification factors and the additional noise appearing at the destination, we obtain the following 
achievable rate for the AF scheme 



i?AF = SUp ~ log 2 

0<« i <l/(l+ 9i ): 1 
1=1,..., N 



1 + 



Kihi + 1 



(8) 



Finally, in BAF the source and relays transmit only during a fraction 5 of the time, remaining silent during the 
remaining channel uses. This allows to increase the transmission power at the source and relays by a factor 1/5 while 
still satisfying the average power constraint. The amplification factor at relay i, Ki, must satisfy Ki < 1/(5 + g-i), 
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with the best performance obtained by optimizing the rate over the feasible values of {K{\f =1 . Since the channel 
is only used a fraction 5 of time, which can also be optimized, the resulting achievable rate of the BAF scheme is 

2" 



i?BAF = SUp - 

0</e<<l/(5+fl 4 ):i=l,...,iV 



log 2 



1 + 



(9) 



B. The Superposition-Partially-Decode-and-Forward communication scheme 

Sending correlated signals at the relays achieves a beamforming gain in the links to the destination. This corre- 
lation can be achieved if the relays decode the source message and re-encode it using the same codebook (decode- 
and-forward), or if they amplify the received signals (amplify-and-forward). As mentioned in the introduction, AF 
leads to poor performance at low SNR due to noise amplification. The use of the AF scheme in bursts (bursty- 
amplify-and-forward) allows to increase the effective symbol SNR while satisfying the power constraints at the 
source and relays. 

The SPDF communication scheme that we propose is a generalization of DF, AF, and BAF. In the BAF scheme, 
the bursts occur in predetermined time intervals, and no information is carried in the timing of the bursts. A 
burst pattern can be interpreted as a binary {0, 1} sequence that has ones in the time positions where the source 
uses positive transmission power. In BAF this binary sequence is constant. Our SPDF scheme is inspired by the 
observation that it is possible to encode information by using different binary patterns as long as the relays and 
destination can identify these patterns. The relays need to determine the binary pattern in order to know when to 
amplify the received signal and when to remain silent. 

As in BAF, we increase the effective SNR of AF by transmitting sparse or low duty cycle signals. One way of 
generating these signals is by generating a binary codebook C\ with random i.i.d. codewords ui ^ with Bernoulli^) 
symbols, and for each of these codewords ui ^ generating a random codebook C^,^ with codewords U2,i 1 ,i 2 with 
symbols distributed as A/"(0, 1/5) for the time indices where tii^Ji] = 1 and with constant value for the time 
indices where Wi^Ji] = 0. The source transmits a message indexed by (ii,^) by transmitting U2 j j lj i 2 . Then the 
relays attempt to decode the binary codeword uj,^ and amplify the received signal only in the time indices t where 
= 1' usm g appropriate gains to satisfy the power constraints. Finally the receiver attempts to decode both 
ui.j 1 and U2,j 1 ,j 2 generating an estimate (ii,^)- We call this communication scheme Binary SPDF or BSPDF. 

The following theorem characterizes the rates that can be achieved with the BSPDF communication scheme. 

Theorem 1. Assume B ~ Bernoulli(5) and X$\{B = 0} = (i.e. a mass of probability one at zero), Xg\{B = 
1} ~ A/"(0, a 2 ). The received signal at relay i = 1, . . . , N, is Y{ = y/glXs + Z%, where Z{ is i.i.d. M(0, 1). Relay 
i, i = 1, . . . , N, amplifies the received signal Y{ by a constant yJ~K~i if B is equal to one, otherwise the relay sets 
its output equal to zero. This results in Yd = 1{B = 1} X^i=i V K-ihiYi + Zr>. If 5a 2 < 1 and 5ni(gia 2 + 1) < 1, 
% = 1 , . . . , N, then the rate R = R\ + R2 is achievable if 

R\ < l(B; Yi), (10) 
R 2 <l(X s ;Y D \B), (11) 
R l + R 2 <l(B,X s ;Y D ). (12) 

Proof: The proof is given in Appendix [A] ■ 

We note that in Theorem[l] k%, . . . , k^v, a 2 and 5 are parameters that can be optimized for given channel gains 
in order to maximize the achievable rate R. 

As will be shown in Section |Vj timesharing between two BSPDF schemes with different parameters can improve 
the achievable rate. This means that it is possible to improve performance by using different average power levels 
in the non-zero parts of the transmitted codewords, and using different amplification factors at the relays for these 
different codeword parts. This suggests that the BSPDF scheme outlined above can be generalized by considering 
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more general distributions to generate C\. In addition, we can generalize the mapping that the relays use to map at 
time t the decoded symbol and received signal (ui^ [t], Ui[t\) into Xi[t], the transmitted signal. 

The following theorem presents a generalization of Theorem[T] to ternary alphabets for the symbols of the 
codewords in C\ and two different amplification factors for the relaying functions of each relay. We call the 
resulting communication scheme Ternary SPDF or TSPDF. 

Theorem 2. Consider the discrete random variable T over the alphabet {0, 1, 2} such that Pr(T = 0) = 1 — 8\ — 82, 
Pr(T = 1) = 8\ and Pr(T = 2) = 82 for 8\,82 > 0, 61 + 62 S% 1. Let the random variable X$ be such that 
X S \{T = 0} = (i.e. a mass of probability one at zero), X S \{T = 1} ~ W(0, a\) and X S \{T = 2} ~ JV(0, of ). 
The received signal at relay i = 1, . . . , N, is given by ^Jg~iX$ + Zi where Zi ~ M(0, 1) are Ltd.. Relay i amplifies 
the received signal by a constant v^il ifT = l, by a constant ■ y /Ki2 ifT = 2, and sets its output to zero ifT = 0, 
for i = 1,2, . . . , N. This results in Yd = 1{T = 1} Y^iLi \Z K nhiYi + \{T = 2} Yli=i V K i2hiYi + Zry- Given 
5\a\ + 82^2 < 1 and 8\Kii(giO~\ + 1) + <52Ki2(<7j02 + 1) < 1 for i = 1,2, ... ,N, then the rate R = R\ + R2 is 
achievable if 

Ri <I(T;Yi), (13) 
R 2 <1(X S ;Y D \T), (14) 
R 1 + R 2 <1(T,X S ;Y D ). (15) 

Proof: The proof is given in Appendix [B] ■ 

We can generalize the BSPDF and TSPDF communication schemes by using more than two message levels and 
require different relays to decode different message-level sets. For example in cases where the channel gains ^fgl 
are different we may require some relays to decode all transmitted message levels, while requiring other relays to 
decode the message at level one, and finally let other relays amplify the received signals without decoding any 
messages. 

In general, we can consider K message levels with K up to 2^ (one message level for each subset of relays). 
However, the fact that the broadcast channel from the source to the relays is degraded implies that N + 1 levels 
suffice. This is because if the message at level k is to be decoded by relay i, all relays j with j > i are also able 
to decode the message due to the assumption gj ^ for j > i. We use f(i), f : {1, . . . , N} — > {0, 1, . . . , K}, to 
denote the maximum message level that relay i can decode or zero if relay i does not decode any message level, 
noting that / is non-decreasing. In this multi-level message setting, we require the destination be able to decode 
all message levels. 

We can further generalize the BSPDF and TSPDF schemes by considering more general relaying functions. 
We denote by Wi(ui, . . . ,UfM,yi) the transfer function of relay i which at time t produces the output Xi[t] = 
Wi(ui[t], . . . ,Uf^[t],yi[t\) based on the decoded codeword symbols m,... , Ufa) and the received signal yi at time 
t. We can think of the symbols ui[t], . . . , Uff{\[t] as a directive to relay i about how to process its input m[t] in 
order to produce its output X{[t]. The multiple codebook levels allow to give different directives to the various 
relays depending on their decoding capabilities. In addition, the codewords obtained from these symbols convey 
part of the source message to the destination. 

Theorem[3] below establishes the rates that can be achieved with this more general communication scheme which 
we call SPDF. While these rates are characterized in terms of single letter bounds, the actual computation of the 
achievable rates requires appropriate choices for the joint distribution of the random variables used to generate the 
codebooks, the function /(•), and the mapping functions used at the relays, which are all design parameters. 

Theorem 3. Let 1 ^ K ^ N + 1 be some integer constant and f : {1, . . . , N} — > {0, 1, . . . , K} be a non- 
decreasing function that denotes the maximum message level that relay i can decode. In addition, let (U\, ■ ■ ■ , Uk) 
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be distributed according to some joint distribution Pu 1 ,...,u K satisfying K[U^] < 1, and define 



Vt =VmU K + Zi, fori = l,..., N, (16) 

N 

Vd=J2 VhiMUi, U f[i) ,Vi) + Z D , (17) 
i=i 

where Z\, . . . , Zn and Zd are i.i.d. Af(0, 1), and the relay transfer functions Wi : IR-fW +1 — > M. for i = 1, . . . , N 
satisfy 

E[( Wl (U u U m ,Vi}) 2 } < 1, for i = 1, . . . , N. 
Then, the rate R = Yl m =l ^ m * s achievable in the parallel relay network if 

f(i) 

} R m <l(U k , ■ ■ . ,Uf(i); Vi\U\, f7jfc-i), where i = min j, for k = 1, . . . , K, 

m=k 
K 

£ Rm <I(U k , ...,U K ;V D \U U ..., E7 fc _i), for k = l,...,K. (18) 

m=fc 

Proof: The proof is given in Appendix [C] ■ 

In order explain the role of the different variables and functions introduced in Theorem[3] we derive their 
relationship to the quantities introduced in Theorem[T] for BSPDF. BSPDF uses K = 2 codebook levels, and 
the following function /: /(l) = . . . = f(N) = 1 (i.e. all relay nodes decode the first level codeword, while the 
destination decodes the first and second level codewords). In addition, the binary auxiliary random variable B of 
Theorem[TJ becomes U\ in Theorem[3j while X$ becomes U<i- In Theorem[T] we set the relay transfer functions 
Wi(B, Yi) = 1{B = l}y/KiYi, for i = 1, . . . , N. We note that due to technical reasons in the proof of Theorem[3] 
the auxiliary random variables (U\,..., Uk) have a continuous distribution. The proof of Theorem[T] shows how 
to emulate the discrete auxiliary random variable of BSPDF with a continuous random variable. 

We can also use Theorem[3] to generalize the BSPDF scheme by allowing different relays to decode different 
message levels. The function /, as before, specifies the maximum message level that relay i can decode or zero if 
it does not decode any message level. We use the notation BSPDF(/) to make the dependence on / explicit. 

Corollary 1 (BSPDF(f)). Assume B ~ Bernoulli{5) and X$\{B = 0} = (i.e. a mass of probability one at 
zero), Xs\{B = 1} ~ A/"(0, a 2 ). The received signal at relay i = 1, . . . , N, is Yi = y/glXs + Z\, where Z% is 
i.i.d. M(0, 1). Relays i with f(i) = amplify the received signals Yi by constants ■s/kI, relays i with f(i) = 1 
amplify the received signals Yi by constants JWi if B is equal to one, otherwise the relays set their output equal 
to zero, and relays i with f(i) = 2 retransmit the source message X$- This results in Yrj = Y2i-f(i)=o V K ihiYi + 



YL%:HCj=\ = l}V K ihiYi + J2i-.f(i)=2 VhiXs + Z D . If 5a 2 < 1, Ki(bgiU 2 + 1) < \, for all i such that f(i) = 0, 
and 5ni(gia 2 + 1) < 1, for all i such that f(i) = 1, then the rate R = R\ + R2 is achievable if 

Ri<l(B;Yi), where i= min j, (19) 

Ki<AT:/(i)=l 

R 2 < l(X s ;Yi\B), where i= min j, (20) 

R 2 <1(X S ;Y D \B), (21) 
R 1 +R 2 <1(B,X S ;Y D ). (22) 

Proof: The proof follows along the same lines of the proof of Theorem[T]by specifying proper auxiliary random 
variables and relay transfer functions in Theorem[3] The details are omitted. ■ 



It will be shown in Section V-C that this BSPDF generalization provides rate improvements in asymmetric 
channels. 
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C. Timesharing different schemes 

It is possible to timeshare different communication schemes, splitting the available time and power among them. 
As will be seen in Section [Vj for some channels timesharing various schemes achieves larger rates than those 
achievable with any of the constituent schemes alone. In this subsection we characterize the rates achievable with 
timesharing. 

Let Ri(Ps, Pi, ■ . ■ , Pjv)j denote the maximum achievable rate with communication scheme i under average 
power constraints at the source and relays Pg,Pi,- ■ ■ ,Pn respectively. Define the achievable rate-power region 
with communication scheme i by TZi, i.e. 

Ki = {(n,ps,pi, ■ ■ ■ ,Pn) G : n ^ Ri(p s ,pi, . . . ,Pn)}- 

Timesharing various schemes indexed by i G X we can achieve rate-power vectors in 1Z = Convex Hull(Uj g x7tj). 
In the channel with unit power constraints at the source and relays we can achieve rate-power vectors in 1Z = 
{(r,p s ,pi, . . ■ ,Pn) 6 TZ : ps < I, Pi < 1, • • • ,Pn < !}■ 

For all the schemes that we consider in this paper Ri(Ps, Pi, . . . , Pjv) is a continuous function. Since in 
addition Ri(0, . . . , 0) = for any communication scheme, we have that Uj S i72.j is a connected set. Using 
Caratheodory's theorem iflOl and its extension we have that any point (r,p s , pi, . . . ,Pn) £ ^ can be written 

as Y,f=i ^j( r j'Ps,j>Pi,ji ■ ■ ■ >PN,j) with Xj ^ 0, Ylj=i = 1 and ( r j,Ps,j,Pi,j, ■ ■ ■ ,PN,j) G Ki for some i, and 
for j = 1, . . . , N + 2. This means that it is enough to timeshare N + 2 pure schemes to achieve the timesharing 
region. 

Furthermore, in some special cases we can reduce the number of pure schemes to timeshare below N + 2. For 
example, in a symmetric parallel relay network we may restrict attention to communication schemes that assign 
the same power to all relays. In this case, we can reduce the dimension of the space where all the sets above (i.e. 
TZi,lZ,7Z) are defined to three, and timesharing three pure schemes suffices to achieve the timesharing region. 

The timesharing results of Section [V] are obtained by using the above properties. 

IV. Cut-set upper bound 

The best known upper bound for the parallel relay network is based on the so-called cut-set upper bound. For 
a general network with node set M a cut Q is any subset of M. Let Yi and Xi be the input and output variables 
associated with node i, and Rij be the transmission rate from node i to node j. The following proposition gives 
the cut-set capacity upper bound for the network [7, Theorem 15.10.1]. 

Proposition 1. If the information rates {Rij} are achievable, there exists some joint probability distribution 
p(X\, . . . ,X\j^\) (or probability density function in case of continuous random variables) such that 

J2 %<I(Xn;Y n c|Xne) 

for all ncj\f. 

For a 2-relay diamond network there are four cuts to consider, as shown in Figure [2] Specializing Proposition[T] 
to this network we obtain the following theorem (stated without proof). 

Proposition 2. The capacity C of the Gaussian 2-relay diamond network is upper bounded by 

C ^ max min {C (s) , C( SRl) ,C\ SR2) ,C iD) } , 
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Relay 2 



Fig. 2. Various cuts in a 2-relay diamond network. 



where 



C (S) =\ l °S2 (1+51 +92) , 
C {SRl) =\ log 2 [(1 + S2)(l + 7*1(1 - p 2 ))] 
C(5K 2 ) = \ log 2 [(1 + 5i)(l + h 2 {l- p 2 ))} 



C(D) =\ log 2 (l + hi + h 2 + 2p\Jh\h<< 



2 
1 

2 

While in principle Proposition[2]can be extended to networks with any number of relays, N, the computation of 
the cut-set bound becomes increasingly complex as N increases. For symmetric networks where g± = g% = . . . = 
qn = 9 and h\ = h 2 = . . . = = h, an alternative upper bound is provided in ||8] as stated below. 

Proposition 3. (K8\ Lemma 6]) For any N ^ 2, g,h > 0, the capacity of the Gaussian N -relay parallel network 
is upper hounded by 

C(N, g, h) ^ sup min (\ log 2 (1 + (N- n)g) + \ log 2 (l + n (l+(n- l)p - "ff'^^ l ^ ) • 



V. Analysis of SPDF in the asymptotic low SNR regime 

Theorems [T] and [2] provide explicit single-letter expressions of achievable rates for BSPDF and TSPDF. However, 
finding the maximum communication rates achievable with these schemes requires optimizing some parameters and 
computing differential entropies of mixtures of Gaussian random variables. 

To simplify the numerical computations and to gain more insight into the low SNR regime we consider an 
asymptotic regime where the channel gains tend to zero while maintaining a constant ratio among them. Since in 
this regime the channel capacity goes to zero, we measure performance by the ratio of the achievable rates and 
some reference channel gain. We consider two types of parallel relay networks: 

• N-relay symmetric: N ^ 2, gi = g 2 = ■ ■ ■ = gjv = fl» hi = h% = . . . = = h. We let h,g — > while 
keeping the ratio g/h constant. We compare the performance of the different schemes by computing R/g. 

• 2-relay asymmetric: N = 2, g\ = h 2 = g, hi = g 2 = h. As before, we let h,g — > while keeping the ratio 
g/h constant. We compare the performance of the different schemes by computing R/y/gh. 

We note that we could have used as performance metric R/x for any x that goes to zero at the same rate as g and 
h. The choice of R/y/gh as performance metric in the 2-relay asymmetric network results in equal performance 
for h/g = a and h/g = 1/a. This symmetry allows to present the results in a more compact form. 
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The benefit of considering the asymptotic low SNR regime is that we can approximate the differential entropy 
of a mixture of Gaussian distributions with an explicit expression using Taylor series, as shown in the following 
lemma. 

Lemma 1. Let the probability distribution of the random variable Y be a mixture of q Gaussian distributions 

q-1 q-1 

Mv) = (1 - E *) e- y2/{2a ° } /(V2Tr<T ) + J> /(V^;), 

i=l i=l 

where oq > 0, c ? ; > 0, 8i > 0, for i = 1, 2, . . . , q — 1 and Ya=i 8% < !• Letting 5 = (5i, 82, ■ ■ ■ , 8q-i), for small 
|| 8 1| 2 we can wn'te f/ie differential entropy ofY as 

1 9-1 
hOn = g log 2 (2vrea 2 ) + 5>(<x?/<x 2 ~ l)/(21n2) + 0(||5|||). 

i=l 

Proof: The proof is given in the Appendix [D] ■ 

A direct application of Lemma[T] results in the following corollary, which gives approximations for two mutual 
information expressions. We use these approximations to compute the performance of binary and ternary SPDF 
schemes. 

Corollary 2. Consider the q-ary random variable Q with the distribution Pr {Q = i} = 8i, for i = 0, 1, . . . q — 1, 
where Yli=o ^ = !• We set the distribution of the random variable W to be W\{Q = i} ~ A/"(0, of), /or 
« = 0, 1, . . . ,q— 1 we define Y = W + Z where Z is statistically independent of Q and W and has distribution 
M(0, <t|). Letting 8 = (61,62, ■■• ,8 q -i), then for a small \\6W2 we have 

9- 1 A. / „1 ' 



i(w ; y|Q)=^|io g2 (1 + ^) 

i=l ^ 



A. Symmetric diamond network with two relays 



In this subsection we consider a network with N = 2 relays, and symmetric gains of the form gi = 92 = 9 and 

hi = h<z = h. 

The following theorem provides an asymptotic characterization of the maximum rate achievable with the Binary 
SPDF scheme. 

Theorem 4. For a symmetric diamond relay network assume that the channel gains from the source to relays are 
equal to ^fg and channel gains from relays to the destination are equal to \fh. The rate h ■ RBSPDF(g/h) + 0(h?) 
bits per network use is achievable, where R.BSPDF(g/h) is the solution of the following optimization problem. 



R, 



BSPDF 



(g/h) 




g/h- Py/gjhln 
2(2^g~7h + (3) 



1 + 



fgJh + P 



*2<^?1M1 + 



g/h 



1 + 



2(2^g~Jh + P) 
P(g/h + Py/g~/h) 



21n2 



(3(2 + g/h + pJg/h) 
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Fig. 3. The upper bound and achievable rates of decode-and-forward (DF), bursty-amplify-and-forward (BAF), and binary-superposition- 
partially-decode-and-forward (BSPDF) communication schemes in the symmetric diamond relay network, ^fg is the channel gain from the 
source to relays and yh is the channel gain from relays to the destination. 



Theorem[6]in the next subsection extends Theorem 4] for iV-relay symmetric parallel networks. We omit the proof 
of Theorem[4] since it is a special case of Theorem[6 

We have solved the optimization for -Rbspdf^/^) numerically for a range of g/h. The results are plotted in 
Figure [3] together with the achievable rates of DF, BAF, and the cut-set upper bound. Also plotted in the figure 
are the rates achievable by timesharing DF with BAF, and by timesharing two BSPDF schemes with different 



parameters (refer to Section III-C for an explanation of the rates achievable by timesharing different schemes). 
The result of timesharing between DF and BAF in Figure [3] appears as a segment of a line that starts at the point 
h/g = 1/4 and rate/g = 1/(2 In 2) and is tangent to the curve of the achievable rate of BAF [2J. Similarly, the 
timesharing of two BSPDF schemes of different parameters results in a segment of a line that starts at the same 
point h/g = 1/4 and rate/g = 1/(2 In 2) and it is tangent to the curve of the achievable rate of BSPDF. 

It follows from Figure|3]that BSPDF outperforms both DF and BAF in the low SNR regime. It also follows from 
the figure that for certain range of h/g timesharing different BSPDF schemes results in higher rates. It is interesting 
to note that for h/g ^ 1/4 the achievable rates of DF and BSPDF match the cut-set upper bound. Therefore, both 
of these schemes are optimal in the asymptotic low SNR regime for the range ^ h/g ^ 1/4. 

We show in Figure [4] the contribution of the binary codebook rate R± and the Gaussian codebook rate R2 to the 
total communication rate R = R\ + R2 in the BSPDF scheme. We observe that for small values of h/g when DF 
is optimal, BSPDF performs full decoding at the relays by assigning zero rate to the Gaussian codebook. As h/g 
increases, it becomes beneficial to start conveying some rate in the Gaussian codebook. For h/g S> 1 most of the 
rate is conveyed by the Gaussian codebook. 

The fact that timesharing BSPDF schemes with different power levels can result in a scheme with higher 
achievable rate suggests that a more flexible coding scheme that uses more than two power levels to select the 
codeword symbols in the C2.i1 codebook, and that uses more amplification factors at the relays, should outperform 
the BSPDF scheme. 

To that effect we consider the Ternary SPDF scheme as described in Theorem[2] and characterize its performance 
in the asymptotic low SNR regime in the following theorem. 



Theorem 5. For a symmetric diamond network with two relays assume that the channel gains from the source to 
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relays are equal to ^fg and channel gains from relays to the destination are equal to \fh. The rate h ■ Rtspdf{9/ h) + 
0{h 2 ) is achievable where RrsPDFig/h) is the solution to the following optimization 



R 



TSPDF 



(9/h) 



sup 

3i,/32,7i,72,«i,«2 



R1+R2 



subject to: 

Ri < 

Ri < 
R 2 < 



2 In 2 
2 In 2 



fa hig/h - In [1 + -ng/h]) + fa {-129 /h - In [1 + ~/ 2 g/h}) ) 

fa ((2 + 4rng/h - In [1 + (2 + A ll9 /h) k x \) k x ) + 
fa ((2 + Arfyg/h) k 2 - In [1 + (2 + A l2 g/h) k 2 }) 



In 



1 + 



4 (7iff/ft-W 
1 + 2ki 



01 + In 



1 + 



4(72fl/ 'h)n 2 
1 + 2k 2 



fa 



2 In 2 

0<fa,fa 

^ 7i,72,«i,« 2 

1 > v 7 ^ (A (1 + 7i#M) «l + fa (1 + WO K2) 
1 > sfgjh (0i7l + fal2) ■ 

Proof: The proof is given in Appendix [E] ■ 

Solving the optimization problem of Theorem[5] is difficult. We have been able to find local optima for various 
values of h/g which are plotted in Figure [5] Notice that any local optimum in the optimization of Theorem[5] also 
results in an achievable rate for the diamond relay network. These numerical results show that TSPDF outperforms 
timeshared BSPDF and other known relaying schemes for symmetric diamond networks with two relays in the 
asymptotic low SNR regime. 
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Fig. 5. The upper bound and achievable rates for decode-and-forward (DF), bursty-amplify-and-forward (BAF), Binary SPDF (BSPDF) 
and Ternary SPDF (TSPDF) communication schemes in the symmetric diamond network with two relays. Jg is the channel gain from the 
source to relays and %/h is the channel gain from relays to the destination. The inset shows an expanded view comparing the achievable 
rates of TSPDF to timeshared BSPDF. 



B. Symmetric parallel network with N relays 



So far we have only considered diamond networks with two relays. In this section we consider a symmetric 
parallel relay network with N > 2 relays. We assume a symmetric network where the channel gains from source to 
relays are equal to ^fg and the channel gains from relays to destination are equal to \fh. Furthermore, we assume 
that g, h <C 1 with a fixed ratio g/h. 

In the following theorem we characterize the achievable rate of the Binary SPDF scheme in the asymptotic low 
SNR regime. We use Theorem[T] and approximate the mutual information expressions for small g and h using 
Corollary|2] 

Theorem 6. The rate h-RBSPz>p(N,g/h) + 0{h?) is achievable in the symmetric N -relay Gaussian parallel network, 
where Jg is the channel gain from the source to relays and \fh is the channel gain from relays to the destination 
and Rbspdf{N , g / h) is the solution of the following optimization 



R. 



BSPDF 



(N,g/h) 



sup [Ri + R 2 

/3>0 V 

subject to: 
Ri < ' ' 



2 In 2 



g/h 



i + 



ft 



Ri < 



R 2 < 



1 



21n2 



2 In 2 



'N(N^g~Jh + ft) 

Vg/h + ft 



i + 



NjN^/gJh + ft) 
ft(g/h + ft^g~/h) 



In 



1 + 



ft (N + g/h + ft^g/h 



Proof: The proof of the theorem is given in Appendix [F] 
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Fig. 6. Cut-set upper bound and achievable rates of the bursty-amplify-and-forward (BAF) and the Binary SPDF (BSPDF) communication 
schemes in the symmetric Gaussian parallel relay network with N = 2, 4 and 8 relays, y/g is the channel gain from the source to relays 
and yh is the channel gain from relays to the destination. 



We numerically computed the achievable rates of BAF and BSPDF, and the results are given in Figure [6] for 
networks with N = 2, 4, 8 relays. Figure [6] also shows the capacity upper bound of Proposition[3]for these networks. 
We see in this figure that as the number of relays increases, the gap between the rates of BSPDF and BAF shrinks. 
As N increases for a fixed h/g ratio, the links between the relays and the destination benefit as a whole from a 
larger beamforming gain while the links from the source to each relay remain invariant. Exploiting the benefits of 
larger N requires increasing the data rate R = R\ + R2, which makes it harder for the relays to decode the lower 
level codewords ui ^. Successful decoding of the lower level messages requires in turn to reduce Ri. In addition, 
BAF can be thought of as a special case of BSPDF with Ri = 0. As a result, as N increases the performance of 
BSPDF approaches that of BAF. 



C. Asymmetric diamond network with two relays 

In the previous subsections we considered symmetric parallel relay networks where g\ = ■ ■ ■ = gN and h\ = 
. . . = h]\f- These symmetric networks model scenarios where all relays are at similar distances to the source and 
destination. In this section we consider a diamond relay network with N = 2 relays, where one of the relays is 
close to the source, while the other is close to the destination. To simplify the analysis we assume g\ = hi = g 
and g% = h\ = h, and as before, consider the regime of g, h — > with a fixed ratio h/g (see Figure [7]). WLOG we 
assume g ^ h. 

As we did in the previous subsections, we analyze the achievable rates with the Binary SPDF communication 
scheme in the low SNR limit for this network, and compare them to the rates that can be obtained with DF and BAF. 
In the asymmetric network setting we consider the extension of the BSPDF scheme given in Corollary [TJ where 
different relays decode different message levels. The function f(i) specifies the maximum message level decoded by 
relay i. We first consider the case where both relays decode only the first message layer (/(l) = /(2) = 1). Using 
Theorem[TJor specializing Coroilary[TJ to this choice of /, and approximating the mutual information expressions 
for small g and h using Corollary|2] we obtain the following result. 

Theorem 7. WLOG we assume g ^ h. The BSPDF(f(l f(2)=l) communication scheme in the asymmetric 
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2-relay diamond network achieves a rate h ■ Rbspdf(x,i)(9 I f h) + 0{h?), where 



RiiSPDF(i,i)(g/h) = sup lRi + R 2 

Kl,K2,j3 ^ 

subject to: 



g/h+ yjg~jh(3\xi 




P 



1 + ki + K 2 g/h + 
\fgJh{y/Ki- 



g/h {y/nj + 2 

2\ 



+ 



2 In 2 



1 + 



P 

g/h (y/Ki + JKi 



P(1 + k 1 + K 2 g/h) 

Ki(g/h + Py/g/h)<l, k 2 (1 + Py/g/h) < 1 
«i > 0,k 2 ^ 0,/3 > 0. 

Proof: The proof of the theorem is given in Appendix [G] ■ 

Due to the asymmetry in the network, it may be beneficial to consider a variation of the BSPDF scheme in 
which the relay with larger source-relay channel gain attempts to completely decode the source messages. This 
corresponds to choosing /(l) = l,/(2) = 2. As a baseline for comparison we consider a communication scheme 
based on BAF, which we call BAF+DF where the relay with larger source-relay channel gain decodes and forwards 
the source message (as in DF) while the other relay amplifies the received signal during the burst interval (as in 
BAF). The following proposition characterizes the achievable rates with BAF+DF. 

Proposition 4. WLOG we assume g ^ h. The BAF+DF relaying scheme can achieve a rate h ■ Rbaf+df(9 / h) + 
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0(h 2 ), where 



R B AF+DF{g/h) = sup R 
subject to: 



2 In 2 
2 In 2 



1 + 



1 + 



PVg/h_ 

g/h(l + ^K) 2 



B>0, k^O, K(f3yJg~Jh + g/h) < 1. 

We now consider the BSPDF(/(1) = l,/(2) = 2) scheme. Using Corollary[T] and approximating the mutual 
information expressions in the low SNR limit with Corollary|2] we we obtain the following result. 

Theorem 8. WLOG we assume g ^ h. The BSPDF(f(l) = l,/(2) = 2) relaying scheme can achieve a rate 
h ■ RBSPDF(i,2)(g/h) + 0(h 2 ), where 



R 



BSPDF(1,2) 



{g/h) 



(#1+^2 

subject to: 



sup 




g/h + p^fgjhln 



8 



(g/h)(l + y/it) 2 + k- By/g/hhi 



1 + K + 



p 



2 In 2 



2 In 2 



1 + 



1 + 



PVg/h_ 

(g/h)(l + ^) 2 



B > 0, k > 0, K(p^g 7 /h + g/h) < 1. 

Proof: The proof of the theorem is given in Appendix [H] ■ 

We numerically computed the achievable rates of DF, BAF, BSPDF(/(1) = l,/(2) = 1), BAF+DF, and 
BSPDF(/(1) = l,/(2) = 2) as a function of h/g and the results are given in Figure [8] Also shown in Figure [8] 
is the achievable rate resulting from timesharing BSPDF with itself for h/g = 1, which can be obtained from the 
BSPDF timesharing curve of Figure [3] for the symmetric diamond relay network with N = 2. We see that for 
h/g = 1 and possibly for other values of h/g (not shown), timesharing BSPDF achieves a larger rate than any of 
the pure schemes. 

Figure g shows that BSPDF(/(1) = l,/(2) = 1), BSPDF(/(1) = l,/(2) = 2) and timeshared BSPDF(/(1) = 
l,/(2) = 1) provide the largest achievable rates of all the schemes considered. 

One may possibly obtain larger rates by increasing the alphabet size of the first level codebook as was done in 
Theorem[5] for the symmetric diamond relay network. The analysis should be similar to the one in Theorem[5] and 
we do not consider it in this paper. 



VI. Bounds on the Minimum Energy-Per-Bit 



In this section, we use the results of Sections IlTfV to compute upper and lower bounds on the minimum energy- 
per-bit for the symmetric diamond relay network. An extension of Theorem 1 in ifTTTl shows that the minimum 
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Fig. 8. Upper bound and achievable rates for various communications schemes in the asymmetric Gaussian diamond network. The schemes 
considered are decode-and-forward (DF), bursty-amplify-and-forward (BAF), bursty-amplify-and forward at relay 1 and decode-and-forward 
at relay 2 (BAF+DF), Binary SPDF (BSPDF(/(1) = 1, /(2) = 1)), and the extension of BSPDF where relay 2 performs decode and 
forward by decoding both level codewords (BSPDF(/(1) = l,/(2) = 2)). The communication model is given in Figure [7] Timesharing 
two BSPDF schemes of different parameters achieves better performance as depicted for the case h — g. The timesharing point is estimated 
from Figure [3] 



energy-per-bit of the diamond relay network can be characterized as 

P ■ t ■ f (1+71+72)^ 

£ b = mf mf — — — , (23) 

7i,72>OP>0 C(P,7iP,7 2 P) 

where C(P, 71 P, 72-P) represents the capacity of the diamond relay network with average power constraint P, 71 P 
and 72 P on the source, relay 1 and relay 2 nodes respectively. For a symmetric diamond relay network, we know 
that by symmetry C(P, 71 P, 72P) = C(P, 72 P, 71 P). Thus, by timesharing we have 

C(P, (71 + 72)P/2, (71 + 72)P/2) > C(P, 71 P, 72P). 
So for any 71, 72 > if we set 7 = (71 + 72)/2 we can write 

(l + 7i+72)P > (l + 7i + 72)P _ (1 + 2 7 )P 



C(P, 71 P, 72P) " C(P, (71 + 72)P/2, (71 + 72)P/2) C(P, 7 P, 7P) ' 
and the minimum energy-per-bit for the symmetric diamond relay network becomes 

£ t =infinf j 1 + 27)P v (24) 
b 7 >op>oC(P,7P,7P) v ' 

The capacity function C (P, 7P, 7P) satisfies the conditions of Lemma 1 in [11] and therefore g(p^p^p) is a 
non-decreasing function in P, for all P > and 7 > 0. This implies that we can replace infp>o by limp^o i n 



(24 1 obtaining 

£ b = inf lim f + 27 ^ P n . (25) 
6 7>op^oC(P, 7 P,7P) 



A lower bound on £ b can be obtained by upper bounding C(P, 7P, 7P) in (25 1 using the cut-set upper bound 



of Proposition|2] Since in Section IV we have assumed the average power constrains on source and relays are one 



and variances of the additive Gaussian noise at relays and destination are one, we need to scale the channel gains 
appropriately in Proposition^ We set the normalized channel gains to be g = Pg/No and h = 7P/1/./V0, i.e. we 
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assume y/g as the channel gain from source to relays and V h as the channel gain from relays to destination in 
Proposition^ obtaining 

r (g+2 y° ln2 ;Vs<i/2 
^ ^ Sft (2 ^ )P = ^ ;1/2 < v ^ 2 • (26) 

1 (w f h ' ;2<V5 

A detailed derivation of this lower bound is given in Appendix [TJ 

In addition, we can find an upper bound on the minimum energy-per-bit of the diamond relay network by 



lower bounding the capacity in (25 1 with an achievable rate of any communication scheme. Since in Section III and 
Section|V]we have also assumed that the average power constraints at the source and relays are one and the variance 
of the additive Gaussian noise at relays and destination is one, we use the same normalization for the channel gains 
to compute the achievable rates, i.e. we set the normalized channel gains to g = Pg/N^ and h = jPh/No. 

If we use DF the achievable rate is given by ([7]), and p5] l results in 

S„ < inf Urn ^ + 1 > P = (" + 2 "f°'° 2 . (27) 

7>o P->o R DF gh 



If we use the achievable rate of BAF given in ( 25 1 we get the following upper bound on the minimum energy-per-bit 
of the symmetric diamond relay network 

a < W Si = - W . 2(1 r +27>JV °'"X T . (28) 



7>0P^0 i? B AF ,9>0,7>0 p^g~h\ n 



Using the achievable rate of the BSPDF scheme we can upper bound the minimum energy-per-bit of the symmetric 
diamond relay network by 

(27 + 1)JV 
7>0 jhR BSPDF (g/(jh)) 

We have plotted the ratio of the upper bound to the lower bound on the minimum energy-per-bit of the symmetric 
diamond relay network for different ratios of h/g in Figure [9] Note that the ratio between the upper and lower 
bounds on £& depends on (g, h) through h/g. 

For BAF, the ratio of the upper bound to the lower bound on Sf, goes to one if the channel gains from relays to 
the destination are much larger or much smaller than the channel gains from the source to relays. The worst-case 
ratio of the upper bound to the lower bound on the minimum energy-per-bit for bursty-amplify-and-forward is 
approximately 2.85. 

In the DF communication scheme the upper bound and the lower bound for minimum energy-per-bit become 
identical when h/g ^ 1/2; however, in the worst case the ratio of the upper bound to the lower bound on the 
minimum energy-per-bit approaches 2 when h/g — > oo. 

Combining the achievable rates of BAF and DF results in a worst-case ratio of 1.94 between the upper and 
lower bounds on £&. Thus, using previously known communication schemes, the minimum energy-per-bit for the 
symmetric diamond network can be characterized within a factor of 1.94. 

The BSPDF communication scheme outperforms both the BAF and DF schemes with respect to the minimum 
energy-per-bit metric for any values of h/g. The worst-case ratio of the upper bound to the lower bound on the 
minimum energy-per-bit of the BSPDF scheme is 1.87 and if h/g ^ 1/2, as in DF, the ratio becomes equal to one. 
Thus, BSPDF allows to tighten the minimum energy-per-bit characterization for the symmetric diamond network 
to within a factor of 1.87. 
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VII. Conclusion 

We presented a new communication scheme, SPDF, for Gaussian iV-relay parallel networks based on superposition 
coding and partial decoding at the relays. Superposition coding allows to construct codebooks of different rates 
targeted to the decoding capabilities of the various relays. Partially decoding the source message at the relays allows 
to reduce noise amplification while relaying the message. The proposed scheme is a generalization of DF, AF, and 
BAF, and achieves rates at least as large as any of these schemes. 

We have shown that by using low duty cycle codebooks and two codebook layers it is is possible to obtain 
strictly better performance than both DF and BAF in the low SNR regime. While most of the rate gains are readily 
obtained with the BSPDF scheme that uses binary symbols in the codewords of the first level codebook, some 
additional rate improvements results from either timesharing BSPDF schemes or increasing the alphabet size of the 
first level codebooks (e.g. TSPDF). 

We presented a characterization of the achievable rates of BPSDF, TSPDF, and the general SPDF scheme in terms 
of single letter expressions. These expressions often require the computation of differential entropies of mixtures 
of Gaussian random variables, and as a result, do not lead in general to closed form solutions. For the low SNR 
asymptotic regime we provided closed form formulas for the achievable rates of BSPDF and TSPDF, which for 
ease of presentation, we derived for channels with some symmetry properties. 

Computation of the maximum achievable rates, even in the low SNR asymptotic regime, requires optimization 
over some design parameters. We evaluated the performance of BSPDF and TSPDF by numerically solving the 
corresponding optimization problems, and compared their performance to that of previously known schemes. We 
note that the requirement of optimizing over some design parameters is not unique to BSPDF or TSPDF, since the 
maximum achievable rate of BAF also requires solving an optimization problem which lacks a known closed form 
solution. 

We showed that BSPDF and TSPDF achieve strictly larger rates than the previously known schemes in the low 
SNR regime for various types of parallel relay channels with some symmetry properties. The symmetry properties 
of these channels allowed us to present results for a wide range of channel gains in two-dimensional plots. We 
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observed that as the number of relays N grows, the performance gains of BSPDF over BAF decrease. 

Finally we showed how our achievable rate characterization for BSPDF in the low SNR asymptotic limit can 
be used to obtain a minimum energy-per-bit upper bound. We used this new bound to obtain a characterization of 
the minimum energy-per-bit for the symmetric Gaussian diamond relay network within a constant multiplicative 
factor of 1.87, improving over a factor of 1.94 that results from using previously known communication schemes 
to derive the minimum energy-per-bit upper bound. 

Since our analysis focused on the low SNR regime, our results do not allow us to improve the capacity 
characterizations within a constant factor or within a constant gap of Gaussian parallel relay networks. However, 
it may be possible to derive achievable rate lower bounds for BSPDF, TSPDF, or SPDF, for non-asymptotic SNR 
which may result in tighter than the currently known capacity characterizations within a constant multiplicative 
factor or additive gap. 



Appendix A 
Proof of TheoremQ] 

Theorem[3] requires (Ui, . . . , Uk) to be continuous random variables. We will use a continuous distribution for 
(J/i, Z7a) that emulates the discrete distribution for B and the mixed distribution for X$\B given in the statement 
of the theorem. 

In Theorem|3j let K = 2. We set Ux ~ Uniform[0, 1], U 2 \{Ui e [0, 1 - 8}} ~ Af(0, e) for some small e > and 
U 2 \{Ui E (1 - 8, 1]} ~ Af(0, a 2 ). Also let /(l) = . . . = f(N) = 1, and define 

if Ui € [0, 1 - 5} 

^KiVi if U x € (1 - 8, 1] 
fori = 1,...,N. 

Assuming 5a 2 + (1 — <5)e < 1 and 5ni(gicr 2 + !)<!, then by Theorem^ the rate R = R\ + R 2 is achievable if 



Wi(Ux,Vi) 



If we can show that 



Rx <l(Ux;Vx), 
R 2 <l(U 2 ;V D \Ux), 
Rx + R 2 <l(Ux,U 2 ;V D ). 



\im.l(Ux;Vx) =l(B;Yx), 

e-S>0 

limI(tf 2 ;Vb|tfi) =l(X s ;Y D \B), 
liiaI(Ux,U 2 ;V D ) =l(B,X s ;Y D ), 



(30) 
(31) 
(32) 

(33) 
(34) 
(35) 



then it would follow that for any rate R = R\ + R 2 that satisfies the conditions of Theorem[T] we can find an e > 
small enough such that R\ ^ R\ and R 2 ^ R 2 , for some R\ and R 2 satisfying (30 1, (31 ), and ( |32| ). This, together 
with Theorem[3] would imply that the rate R = Rx + R 2 is achievable. 



In order to prove (33 1 we will show that lim e _>.o h(^i) = K^i)- To that end, we note that 



P Vl (x) = 5- 



+ (1-8)- 



- X y(2( gi e+X)) 



^27r( 9l a 2 + 1) V 2 <9it + 1) 

converges to Py^x) as e — > for every x. In addition, the continuity of the log 2 (-) function implies that 

Pv 1 (x)log 2 (Pv 1 (x)) converges to Py 1 (x) log 2 (iVi {%)) as e — )• for every x. 

Letting 



9(x) 



A = max 
2 



2 max(«5, 1 - 5) 



l0gi V2vr 
max(5, 1 — 8)e 2<9 i c 



log 2 



min(5, 1 — 5) 



^{ gi a 2 + 1) 

J2 



v A + 



2 In 2 
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we have that \Py 1 (x) log 2 (Py 1 (x))\ ^ g(x) for e < a 2 for all x. Since g(x) is integrable, the dominated convergence 
theorem implies that lim^o h(Vi) = h(Yi). 



We then compute h(Vi|i7i) 



l-<5 



log 2 (27re( 5 ie + l)) + f lo g2 (2vre( 5 ia 2 + l)) implying that lim e _, h(Fi|Z7i) 



log 2 (27re) + § log 2 (27re(5ia 2 + 1)) = h(Yi|B). As a result, (|33j> follows. 



Finally, we note that the distribution of Vd does not depend on e so by direct calculation we obtain that 
l(U 2 ; V D \Ui) = I(Xs; Y D \B) and l(Ui, U 2 ; V D ) = 1(B, X s ; Y D ) for any e, and in particular, in the limits ([34l and 



Appendix B 
Proof of Theorem[2] 

In Theorem[3} let K = 2, U\ ~ Uniform[0, 1]. Set U 2 \{U 1 G [0, 1 - Si - S 2 ]} ~ A/"(0, e), f7 2 |{E/i G (1 - <5 X - 
«5 2 , 1 - ^]} ~ AA(0,a 2 ), C7 2 |{C7i G (1 - <5 2 , 1]} ~ M(0,o%), for ft, <5 2 > 0, < 5 X + S 2 ^ I, < e < 1 and 
<5icr 2 + S 2 a\ + (1 - Si - 5 2 )e < 1. Also let /(l) = . . . = /(iV) = 1, and define 

3 ; if U x G [0, 1 - <5i - <J 2 ] 

«>i(tfi,^) = 4 ; if J7i G (1 — <5i — 5 2 , 1 — <5 2 ] , 

; if U x G (1 - 5a, 1] 



with b\Ki\(gia\ + 1) + S 2 K i2 {giu\ + 1) < 1, i 



i?i + R 2 is achievable if 



(36) 
(37) 
(38) 



l,...,N. Then, i? = 

Ri<i{Ur,vi), 

R 2 <l(U 2 ;V D \U l ), 
R 1 + R 2 <l{U ll U 2 -V D ). 

As was done in the proof of Theorem[T] by choosing an appropriate integrable upper bound for | Py 1 (x) log 2 (Py 1 (x))\ 
that is independent of e, we can use the dominated convergence theorem and direct calculations to show that 

limI(C/i;Fi) =I(T;Yi), 

\imIiU2-VD\Ui) =1{X S ;Y D \T), 

limI(l7i,l7 2 ;Vb) =I(T,X 5; y D ). 

Thus, for any rate R = R\ + R 2 that satisfies the conditions of Theorem 2] we can find an e > small enough 
such that R\ ^ R\ and R 2 ^ i? 2 for some R\ and i? 2 satisfying (36 1, (|37l, (38 1, and as a result, by Theorem 
the rate R = R\ + R 2 is achievable. 



Appendix C 
Proof of Theorem[3] 



The following definition and basic properties of typical sets can be found in |7] and will be used in the proof 
of Theorem[3] 

Definition 1. For a finite collection of real random variables {X\, X 2 , ■ ■ ■ , X k ) with joint probability density function 
/x(xi,x 2 , . . . , Xk) let S denote an ordered subset of the random variables and consider n independent identically 
distributed copies of S. The e-typical set Ai 1 of the n-sequences (x™,x 2 , . . . , x?) is defined as 

Ai n \x 1 ,X 2 ,...,X k ) = 

-log/ x (s™)-h(S) 



Wr,xS,...,xJ): 
Proposition 5. For any e > 0, and for sufficiently large n, 

Pr{4")(S)} > 1-e, VSQ{X 1 ,X 2 ,...,X k }. 



<e,VSC{l 1 ,I 2) ...,X,} . 



(39) 
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Proposition 6. Let Aj 1 (Xi, X 2 ) be the typical set corresponding to the probability density function fxi,x 2 {x\,x 2 ) 
and let (X\,X 2 ) have probability density function 

Then 

Pr{(x",x") G A ( £ l) {X l ,X 2 )} = 2~ n ^ x ^ ±3e \ 

Proposition 7. For the probability density function fx u x 2 ,x 3 (xi, x 2 , £3) let Ae (X±, X2, X3) be the typical set 
and let (X\, X 2 , X%) have probability density function 

fx u X 2 ,X 3 ( X ^ X ^ X 3) = fx 1 ( x l)fx 2 \X 1 {x 2 \x 1 )fx 3 \X 1 {x3\x 1 ). 

Then 

Pr{(x?,x2,x£) G A<£ l) {X 1 ,X 2 ,X 3 )} = 2 - n W x ^\ x ^\ 

Proof: (of Theorem[3]) 

We communicate over 6 + 1 blocks of length n each. In the first block, the source sends a message in 
{1,..., [2 nfl J} which is received by all relays. The relays remain silent during this first block. In block m, 
1 < m ^ 6, the source encodes a new message in {1, . . . , [2 nfi J} which is received by all the relays, while the 
relays encode signals based on the signals received in block m — 1. Finally, in block 6+1 the source remains silent 
while the relays encode signals based on the signals received in block 6. Assuming that the destination successfully 
decodes all the messages, the communication rate is 6/(6 + l)R for large n, which tends to R as 6 — > 00. We next 
describe and analyze the coding scheme used in each block of length n. To simplify notation, we use index t to 
represent the time index in block m for the signal transmitted at the source and received at the relays, and the time 
index in block m + 1 for the signal transmitted by the relays and received at the destination, for m = 1, . . . , b. 

Code construction: We generate a codebook C\ with 2 nRl random i.i.d. codewords chosen according to P\j 1 . 
For each 11™ ^ G C\, we generate a codebook C 2 ,i 1 with 2 nR2 codewords with symbols u 2: i 1: i 2 [t] chosen indepen- 
dently according to f[/ 2 |[/i=ui, 4l [t]- Proceeding similarly, for each G C u ^ G C 2>il ,..., ^ ^ ^ G 
Cfc-i,ii,i 2 ,...,i fc -2 we generate a codebook Cfc,i 1 ,...,i fe _ 1 with 2 nRk codewords with symbols Uk,i u ...,i k [t] chosen inde- 
pendently according to Pu K \U 1 =u l , H [t] y ...,u k . l =u h - x , H ,,.., ih _^W tor k = 3, . . . , K. 

Each message W G {1,2,..., M} is represented by a i^T-tuple . . . , %k) where ik G {1, . . . , |_2 n *J }. The 
source transmits message (ii, . . . , ix) by sending the codeword u K ^ i G CK,i 1 ,...,i K _ 1 - That is, 

= Enc s (W) = Enc 5 (n, ...,i K ) = {u Ktilr ,. jiK [l], . . . , ur^,...,^ [«]). 

Relay processing: At the end of each block m G {1, ... ,6}, relay j attempts to decode the codewords , 
u 2 ii i 2 ' u /(*)>*i>*2,. -,*/(j) wnere fU) ( as defined above) indicates the highest message level that the relay attempts 

to decode. Decoding consists of finding the unique set of codewords -fir?- , ir+ , , . . . , u™ - -. } in codebooks 

Mi' 2,ii,i 2 3 /u),»i,-.V(i) 
Ci,C 2 ' il , ■ ■ ■ >Cf(j) l l t n 1 iriai are jointly typical with Y", declaring an error if it finds zero or more than one 
such sets. 

Relay j encodes the signal to be sent in transmission block m + 1 based on the received signal Y? and the 
decoded codewords {u^ i k }l=\ m block m as follows 

Xj[t] = En Cj (Yj)[t} = w 3 (u hii [t], . . ■,u fmu __ Jf Jt],Y j [t]), for t = 1, . . . , n. 

Decoding at the destination: The destination attempts to decode the codewords u n {i , u 2ii i2 , VLK,ii,i 2 ,...,iK fr° m 
the received signal in each block Y^, using joint typicality decoding as explained for the relays. Decoding consists 
of finding the unique set of codewords |u+ , u™, , , . . . , u" , , ) in codebooks Ci, C - , . . . , C,^ ■ • that 

b M L 1,11 ' 2,11,12' ' K,l!,...,t K J 11 2,li> ' K,Jl,...,l K _i 

are jointly typical with Y^,, declaring an error if it finds zero or more than one such sets. The decoded message 
in each block is given by Dec(Y^) = (ii, . . . , ijc). 
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Analysis of the probability of error: We find the average probability of error over all the random code construc- 
tions. Due to the symmetry of the random code construction, the conditional probability of error is independent of 
the transmitted messages, i.e. pj™^ = Pr{Dec(Y^) ^ (1, . . . , 1) | message (1, . . . , 1) was sent}, and so WLOG 
we assume that u^- ll l has been sent. 

There is an error whenever some relay or the destination declares an error. For any given block, relay j declares 
an error if the correct codewords {u™ 1; . . . , u™^) x i\ are not jointly typical with Y™, or if incorrect codewords 



u 



i } are jointly typical with Y™ for some . . . , if(j)) /(!,..., 1). Similarly, the destination 



declares an error if the correct codewords {u" 1? . . . , x } are not jointly typical with Y^,, or if incorrect 

codewords {u™^, . . . , u^- i i } are jointly typical with Y^> for some (ii, . . . , %k) ^ (1, . . . , 1). 

In addition, to guarantee the power constraints at the source and the relays be satisfied, if the average power of 
the signal to be transmitted at the source is larger than one, or if the average power of the signal to be transmitted 
at some relay is larger than one, we declare an error and do not transmit the corresponding signals. 

Define the following events 

E R = {relay j successfully decodes all messages up to level f(j), for j = 1, . . . , N} 



E 



D 



E 



ii...if(j) 

e s : 



• . , < )il)i2) ... )iK , Y£) G At\U^ ...,U K , V D )\E R } 
{Wi,>- ■ ■ > u /(i)i ,v,r Y i) e A i n) (Ux, • • • , U m , Vj )} , for j = 1, 



t=i 



E 



R j 



j^EC^KiMi- • • i^wI^iW) 2 > l| , for j = 1, ... , JV. 



Then the probability of error can be upper bounded as follows using the union bound, 

P] n ) =Pr(Error|£ il ) Pi(E R ) + Pr(Error| {E R ) C ) Pi((E R ) c ) 
sC Pr(Error|^) + Pt{(E r ) c ) 
= Pr [( J B 1 D .. 1 ) c UU (ll ...^^ (1 ... 1) ^... 4 J 



+ Pr 



E P U Uf =1 ((^. 1 ) c U U^..,^,)^...!)^.,^ u 



N 



5 r ( , ) + Pr ( E R * 



(il...i K )^(l...l) 

((^.!) C ) + £ Pr 

V 7 (h...V(J))?i(l...l) 

By using Proposition|5j for large n we have Pt((E® ]J C ) < e, and Pr((£^ J ^) c ) < e for j = 1, . . . , iV. By the law 
of large numbers, for n large enough, we also have Pr(.E~J < e, and Pr(E v 3 ) < e for j = 1, . . . , N. 

We bound the sum involving Pr [E? i ) as follows, 



Pi £ 



E ^ (<€.,«) - E pr^^j + .-.+ E^K- 



+ . . . + 2 n ^2" n(I(t/if;VD|c/l '--' C/A "- l) " 4e) 



(40) 



where we used the fact that (u^ , u?^ ia , . . . , ^ j ) is independent of Y^ when «i 7^ 1, the fact that 
(tig l i2 , • . • , u^j f j ) is conditionally independent of Y^> conditioned on when Z2 7^ 1, the fact that 
u x 1 ii 1S conditionally independent of Yjr, conditioned on (11™ l5 Urf 1 1 , . . . , u^ : _ 1 x x ) when %k 7^ 1, and 
Propositions [6] and |7J valid for large n. 
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Ft 

Similarly, we bound the sum involving Pr [E^' i J for j = 1, . . . , N, as follows, 



V Pr (E*> , ) = 

(«i...i /(3) )^(l...l) ii^l,4a,...,i/ W ) 



V Pr(£;;- 3 , ) + 



Pr LE 



li 2 ...if 



+•••+ E Pr Kii V( ,) 

^2 n ^=i )flfc 2- n(I(t/l '-'^«> ; ^ )_3£) + 2 n ^=2^2~" (I(l/2 '-' C/ ^^ ; ^ |c/l) - 4e) 
+ . . . + 2 n ' R/u> 2~ n ( I ( c//(j ' );yj 'l c/lv --' l//<3) - 1 )~ 4e ). (41) 



The exponent of each of the terms in (41 1 gives a rate constraint that needs to be satisfied in order to have 
vanishingly small error probability as n — > 0. Due to the degradedness of the broadcast channel from the source to 



the relays, many of these rate constraints are redundant. The non-redundant rate constraints arising from (41 1 for 



j = 1, . . . , N together with the rate constraints arising from (40 1 are given in ( 18 1. 



Since e is arbitrarily small, when the conditions ( [18] ) are satisfied, there exists a sequence of codebooks with 
average probability of error going to zero as n tends to infinity. As a result, the rate R = Y2k=i ^ * s achievable. 



Appendix D 
Proof of LemmaQ] 

We find the Taylor series of h(Y) for 6 around zero. We will use Leibniz integral rule for improper integrals |[T3ll . 

Proposition 8 (Leibniz integral rule). Let f(x,y) and its partial derivative ^f(x,y) be continuous everywhere. If 
there exist functions g\{x) and gi{x) such that \f(x,y)\ 5j gi(x) , \-§^f(x,y)\ ^ g<i{x) for all x, gi(x)dx < oo 
and g2(x)dx < oo, then 

±J_J(x,y) d x = J_J/ y (x,y)dx. 

In order to apply Proposition[8] we need to verify that the conditions of the proposition apply to the function 

f(y,S) = fr(y,S) In f Y (y, 8), where 



/ q-l \ q-l 

f Y (y,S)= 1-J> e ' y2/i2a2 ° ) /(^o) + ^5 i e- 

V i=l / i=l 



/(V2irai). 



It is easy to verify that f(y, S) and ^f(y, 6) are continuous for i = 1, . . . , q—1. We next find a function g{y) 
such that \f(y,S)\ < g{y) and j^ oo g{y)dy < oo. To that end we first find lower and upper bounds for fy(y,S) 
that are independent of 5. We assume that Y2t=i Si < Sq < 1, and define cr m i n = minj g | ,.. g _n ^% and a max = 



max. 



e{o,...,g-i} a i- 



I -S 



2nao 



- y2/{2a2 ° ] < fv(y,6) < 



2no r , 



We use these bounds to obtain 



g(y) 



-J/ 2 /(2^ ax ) 



27T(7 T1 



max 



In 



1 



In 



2-KGr, 



+ 



2ai 



which is integrable. 

For each % = 1, . . . , q-l, we find a function g^y) such that | gfr/(y, S)\ < g^y) and g-i{y)dy < oo. Noting 
that 

d ^ ( I 



dSi 



f(y,S) 



/ 27TCJo 



+ — L_ e-y*/^) (lnf Y (y,S) + l) 
V 2tvo~ i / 
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we obtain 

9i(y) = , 

Z7TO"o 

which is also integrable. 



1 e -yVV°l) + 1 



2,71(74 



max 



In 



J>o\ 



/ 2~TTao ) 



In 



+ 



2ai 



+ 1 



Having verified the conditions of Proposition[8j we can exchange the order of derivation and integration to 
compute the Taylor series of h(Y) as follows, 



9-1 



h(Y) = h(Y)| 5=0 + J> 



i=l 



dh(Y) 

88,. 



=0 + O(||5||l) 

9-1 
-n ^ — ' 



i=l 

9-1 /-oo 



-/y(y)log 2 [/y(?/)] 

/~ ( - /y GO iog 2 [/r(y)]) | fco + / 
l 9-1 

- log 2 [27r eC T 2 ] + 8itf/o% ~ l)/(2 In 2) + 0(||<J||i). 



<5=0 



+ 0(1151 



d_ 
d5i 



/y(y)log 2 [/y(y)]) _ Jy + 0(\\8\\l) 



5=0 



i=l 



Appendix E 
Proof of Theorem[5] 

In Theorem]^ we set 8\ = fiwfgh, <5 2 = fii\[gh, K\ = hn\\ = hn^x, k 2 = hn\2 = ^22 and we set 71 = ha\, 
7 2 = ha\, for positive constants /3i, /3 2 , 72, 72, «i, ^2 in R + . We know the rate R\ + i?2 is achievable if 

R X <I(T;Yi), 
i? 2 <I(X s ;Y D |r) ) 
i?! + i? 2 < I(T, X 5 ; Yd) = I(T; Y D ) + 1{X S ; Y D \T). 

We also can achieve the maximum sum rate -Ri + R2 if 

R X <I(T;Y), 
fli<I(T;Yb), 
fl 2 <I(X S ;Y D \T). 

We compute the mutual information expressions I(T; Yi) and I(Xs; Yd|T) for small /i. 

The probability distribution of Yi conditioned on T = 1 and T = 2 is equal to A/"(0, ga\ + 1) and A/"(0, ga\ + 1) 



respectively. For the case that T = the distribution of Yi is AA(0, 1). By applying Corollary|2] we have 



and 



I(T;Y! 



1(X S ;Y D \T) 



2 In 2 

2 
h 



Wi (rtig/h - ln[l + ll9 /h\) + /3 2 (722A - Ml + lig/h})) + 0{h 2 ) 



(42) 



log 2 1 + 



2ki + 1 



82, 

+ TT lo S2 



1 + 



4ffq-|^2 

2k 2 + 1 



5//lln 



1 + 



4(g/fc)7i«i 

1+2/ti 



01 + V5M ln 



1 + 



Mg/h)-y 2 K 2 
1+2k 2 



ft 



2 In 2 



+ 0(/i 2 
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Finally to compute I(T; Yd), we note that the distribution of Yd conditioned on T being 0, 1, and 2 is equal to 
jV(0, 1), N(0,4ga±Ki + 2ki + 1) and Af(0,4ga%K 2 + 2k 2 + 1) respectively. Then, by Corollary|2] we have 

Yd) + 2k x - ln[l + 4(g/h)^x + 2«i])+ 

/3 2 (4( 9 // l ) 72K2 + 2k 2 - ln[l + 4{g/h) l2 K 2 + 2«a])) + 0(/> 2 ). (43) 

To complete the proof we need to impose the power constraints at the source and relays: 

tfiof + b 2 a\ = y/g/h('YiPi + 72A2) < 1, 
SiKaigaj + 1) + 6 2 K i2 (gai + 1) = y/g~Jh{B x {l + ngfh)^ + /3 2 (1 + W>0«2) < 1. 



Appendix F 
Proof of Theorem[6] 

The derivation of the achievable rates in the low SNR regime for BSPDF coding in the symmetric parallel relay 
network with N relays is similar to the case of the two relay network. The difference is in the distribution of the 
signal Yd which is the combination of signals from N relays instead of two relays. By applying Theorem[T]we are 
able to achieve any rate R\ + R 2 such that 



Rx <1(B;Y{), 
R 2 <l(X s ;Y D \B) 
Rx + R 2 < I(B, X S ; Y D ) = 1(5; Y D ) + l(X s ; Y D \B) 



(44) 
(45) 
(46) 



, and (46 1 also satisfies the 



Therefore, the maximum sum rate Rx + R 2 that satisfies the conditions ( |44| ), 
conditions 

Rx <l(B;Yx), 

Rx<l(B;Y D ), 

R 2 <I(X S ;Y D \B). 

In Theorem[T] we set 5 = B\[gh such that < 5 ^ 1 and k = Ki < l/(g + 5) for i = 1,2, . . . , N. Let 
0-2 = yj g/h/ + 1. Conditioned on B = 1, the random variable Yi has distribution jV(0, a 2 ) and conditioned on 
B = we have Yi ~ M(0, 1). Thus by Corollary|2] 



l{B ] Yx) = h\g/h+^/gJhp\n 



P 



/g/h + B 



/(2 In 2) + 0(h 2 



At the destination, optimizing over the value of k we get k = l/(g + 5). If we define a D = N 2 ^g/h/(B(g/h + 
Byfgjh)) + N/(g/h + By/g/h) + 1, then conditioned on B = 1 we have Y D 
5 = 0, we have Yd ~ AA(0, 1), thus using Corollary]!] 



A/"(0, cr|j) and conditioned on 



1(5, Yd) = hB^gJhlyNy g /h/(B(g/h + /fy^A)) + iV/(^//i + B^gjh) 



In 



1 + N 2 ^g/h/(B{g/h + /S^sM)) + ^/(ffA + /(2 In 2) + 0(h 



and 



I(X S ;Y D |5) = /i/3V / < 7 //ilog 2 



1 + 



1 + iV/( 5 //t + /3v^) 



After some simplification the theorem follows. 
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Appendix G 
Proof of Theorem[7] 

Using Theorem[TJ we can achieve the rate R\ + R 2 if 

Ri <1{B;Y 1 ) 
R 2 <l{X s ;Y D \B) 
Rt + R 2 <1(B,X S ;Y D ) = l(B;Y D ) + I(X S ;Y D \B). 

Equivalently, we can achieve the maximum sum rate R\ + R 2 if 

Ri<l{B;Y D ) 
R 2 <l(X s ;Y D \B). 

We set 5 = fiyfgh for a positive constant f3 such that < fi\fgh ^ 1, k\ = Kk\ < h/(g + 5) and k 2 = hn 2 < 
h/(h + 5). Let a\ = \/g/h/(3 + 1 and a\ = l/(/3y g/h) + 1. Conditioned on B = 1 the random variable Y\ has 
distribution A/"(0, a\). Conditioned on B = we have Y\ ~ AA(0, 1). Then by Corollary|2] we have 

l(B; Fi) = hp^/gjh(^/gjh/p - ln(^/gjh/p + l))/(21n2) + 0(h 2 ). 

For the destination defining d| = l^/g/h(\/~k\ + \fk 2 ) 2 j ji + k\ + (g/h)k 2 + 1, conditioned on B = 1 we have 
Yd ~ A/"(0, CJ3) and conditioned on B = we have Yd ~ AA(0, 1). Therefore 

l(B; Yd) = 



21n2 



In 



1 + 



A-i + V^)7/3 + &i + f I + 



Finally by Corollary|2j 

I(X s ;Yd|S) 
which completes the proof. 



log 2 1 + 



1 + h + {g/h)k 2 



Appendix H 
Proof of Theorem^] 

We use two layers of coding requiring relay 2 and the destination to decode both layers of the code and while 
requiring relay 1 to decode only the first layer. By Corollary[T] we can show that the rate Ri + R 2 is achievable if 

Rt <I(fl;Yi), 
R 2 <l(X s ;Y 2 \B), 
R 2 <l(X s ;Y D \B), 
R! + R 2 <I(B,X S ; Yd) = 1(5; Y D ) +I(X S ;Y D \B). 
Equivalently, the following conditions are sufficient to achieve the maximum sum rate R\ + R 2 , 

Rt <I(S;Y 1 ), 
Ri<I(B;Y D ), 
R 2 <l(X s ;Y 2 \B), 
R 2 <l(X s ;Y D \B). 

In the following we compute, 1(5; Yi), I(B,Y D ), I(X S ;Y 2 \B), and I(X S ; Y D \B). Set 5 = pyjg~h, and define 
k = Kih, assuming that <5(k//i)(1 + g/5) < 1 in order to satisfy the power constraint at relay 1. 
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Conditioned on B = 1 we have Y x ~ A/"(0, of), Y 2 ~ A/"(0, a\\ and Y D ~ A/"(0, a 2 D ), where ct 2 = ^fgjh/fi + 1, 
0-2 = \j^^Jh) + 1, and o-fj = yfg]h{l + Vk) 2 //3 + k + 1. Conditioned on B = we have Yi ~ A/"(0, 1), 
y 2 ~ AA(0, 1), and Yd ~ AA(0, 1). 

By applying Corollary[2] we get 

Yi) = hp^gjh^g~/h/p - In [l + sffih/{§ /(21n2) + 0(/i 2 ), 

Y D ) = hpyfgjh[^jh{\ + / P + k - In [l + + V^)7£ + «] ) /(2 In 2) + 0(/> 2 ). 

To complete the proof we need to compute I(X^; Y 2 |l?) and IpCs; Yd|S). We have by Corollary[2] 



I(X S ;Y 2 \B) 
l(X s ;Y D \B) 



h(3^/gjh 



hp^g~/h 



log 2 1 + 



log 2 1 + 



^A(l + ^) 2 //3 N 

K + l 



Appendix I 

Derivation of the lower bound on the minimum energy-per-bit 

In order to find a lower bound on the minimum energy-per-bit we normalize the channel gains as g = Pg/No 
and h = jPh/N 

(2 7 + l)P 



£b ^ inf lim 



^o?->o C, 



up 



> inf 



;i + 2 7 )iVom2 



7>0 maxosjp^i mm{#, (g + 7/1(1 - p 2 ))/2, 7/1(1 + p)} 
iV ln2 

sup 7>0 max ^ p <ci mm j ^pi, 2 ( 2 ; +i y , 27+1 J 



(47) 
(48) 
(49) 



For a fixed 7 we can solve the maximization over p 



max min{s, (g + 7/1(1 - /9 2 ))/2, 7/1(1 + p)} = { Vigh 
O^psjl 



< 7/1/5 < I/ 4 
1/4 s; 7/1/5 < 1 

1 < jh/g 



and therefore, 



sup max mm 

7>o o^p^i 



5 g + 7/1(1 -p 2 ) 7 /i(l + p) 



2 7 + 1' 2(27 + 1) ' 2 7 + l 



max < max 



27/1 



max 



, max ■ 



9 



: max 



0< 7 <^ 27 + l'£< 7 <£ 27 + 1' f< 7 2 7 + l 

gh y/^/gh gh 

-, max , 

<7 + 2/i'A <T<f 27 + 1'2^ + h 



max 
max 

gh 
2g+h 

I gh 
8 
gh 

. g+2h 



gh I gh gh 

}g+2h' V ~8~' 2p+/i 
g+2h' 2g+/i J 

0<g/h< 1/2 

1/2 ^g/h^2 
2<g/h 



; 1/2 ^g/h^2 
: otherwise 



30 



References 

[1] B. E. Schein, and R. Gallager, "The Gaussian parallel relay network," IEEE International Symposium on Information Theory (ISIT), 
p. 22, June 2000. 

[2] B. E. Schein, Distributed Coordination in network Information Theory, PhD. dissertation, MIT, October, 2001. 

[3] A. S. Avestimehr, S. N. Diggavi, and D. N. C. Tse, "Wireless information flow: a deterministic approach," IEEE Trans. Inf. Theory, 

Vol. 57, No. 4, pp. 1872-1905, Apr. 2011. 
[4] A. Ozgiir, and S. Diggavi, "Approximately achieving Gaussian relay network capacity with lattice codes," IEEE International Symposium 

on Information Theory (ISIT), pp. 669-673, Jun. 2010. 
[5] Y. Kochman, A. Khina, U. Erez, and R. Zamir, "Rematch and forward for parallel relay networks," IEEE International Symposium on 

Information Theory (ISIT), pp. 767-771, July, 2008. 
[6] S.S.C. Rezaei, S.O. Gharan, and A.K. Khandani, "A new achievable rate for the Gaussian parallel relay channel," IEEE International 

Symposium on Information Theory (ISIT), pp. 194-198, June-July, 2009. 
[7] TM. Cover and J.A.Thomas, Elements of Information Theory, New York: Wiley, 1991. 

[8] U. Niesen and S. Diggavi, "The approximate capacity of the Gaussian N-relay diamond network," IEEE International Symposium on 

Information Theory (ISIT), pp. 259-263, July-August, 2011. 
[9] M. Gastpar and M. Vetterli, "On the capacity of large Gaussian relay networks," IEEE Transactions on Information Theory, vol. 51, 

pp. 765-779, March 2005. 
[10] H. G. Eggleston, Convexity, Cambridge University Press, Cambridge, 1969. 

[11] A. El Gamal, M. Mohseni and S. Zahedi, "Bounds on capacity and minimum energy-per-bit for AWGN relay channels," IEEE 

Transactions on Information Theory, vol.52, no.4, pp. 1545-1561, April 2006. 
[12] S. Avestimehr and D. Tse, "Outage Capacity of the Fading Relay Channel in the Low-SNR Regime," IEEE Transactions on Information 

Theory, vol.53, no.4, pp. 1401-1415, April 2007. 
[13] M.H. Protter and C.B. Morrey, Intermediate Calculus, Second Edition, Springer- Verlag, New York, 1984. 



